Pleasant extensions retaining algebraic 
structure, I 



Tim Austin 



Abstract 



In the recent papers f4l| and f3\ we introduced some new techniques for 
constructing an extension of a probability-preserving system T : l/- r\ 
(X, /i) that enjoys certain desirable properties in connexion with the asymp- 
totic behaviour of some related nonconventional ergodic averages. 

The present paper is the first of two that will explore various refinements 
and extensions of these ideas. This first part is dedicated to some much more 
general machinery for the construction of extensions that can be used to re- 
cover the results of [4, 3|. It also contains two relatively simple new appli- 
cations of this machinery to the study of certain families of nonconventional 
averages, one in discrete and one in continuous time (convergence being a 
new result for the latter). 

In the forthcoming second part |2| we will introduce the problem of de- 
scribing the characteristic factors and the limit of the linear nonconventional 
averages 



when the directions pi, P2, G 1/^ are not assumed to be linearly 

independent, and provide a fairly detailed solution in the case when fc = 3, 
d ~ 2 and any pair of directions is linearly independent. This will then be 
used to prove the convergence in of the quadratic nonconventional 

averages 
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1 Introduction 

Suppose that Ti, T2, . . . , Tk {X, fi) is a system of commuting invertible trans- 
formations on a standard Borel probability space. To such a system we can asso- 
ciate various 'nonconventional' ergodic averages, such as the averages 

N k 
n=l i=l 

or their more complicated relatives of the form 

N k 
n=l i=l 

for an action T : r\ {X, fi) and polynomial mappings pi : TL — > Zf^ for 
z = 1, 2, . . . , (we sometimes refer to these as 'linear' and 'polynomial' averages 
respectively). 

The linear averages have been the subject of considerable study since they first 
emerged from Furstenberg's ergodic theoretic approach to Szemeredi's Theorem 
and generalizations llT4llT5l . and more recently their polynomial relatives have re- 
ceived similar attention since the extension of Furstenberg's work to an appropriate 
polynomial setting ||5l|6l. These applications to Arithmetic Ramsey Theory typ- 
ically require that certain related scalar averages 'stay large' as — 00, but it 
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quickly became clear that the more fundamental question of their norm conver- 
gence in L?'{p) posed an interesting challenge in its own right. After several im- 
portant partial results |l8l|l[M|28l[30lll6l[T9l|2il|3ri,the convergence of the hnear 
averages in general was settled by Tao in |(29l|. By contrast, our understanding of 
the polynomial case remains poor. 

In H we gave a new proof of convergence in the linear case, more classically er- 
godic theoretic than Tao's (which relies on a conversion of the problem into an 
equivalent quantitative assertion about the shift transformations on {'L/N'LY). In 
this paper and its sequel [2] we shall further develop the methods of |4| to pro- 
vide some more versatile machinery, and illustrate its use with a proof of norm 
convergence for the new polynomial instance 

1 ^ 

j^Y.^hoTf){hoTfT^). 

n=l 

Most analyses of such questions rest on the fundamental notion of 'characteristic 
factors' for a system of averages, first made explicit by Furstenberg and Weiss 
in their work lfT6l on a case of polynomial averages involving only only a single 
underlying transformation. Here a tuple of factors : {X,fi,T) — )• {Yi,Vi,Si), 
i = 1, 2, . . . , fc, of a Z'^-system will be termed characteristic for some tuple of 
polynomial mappings pi : TL — > if 

N d ^ N d 

n=l 1=1 n=l i=l 

for any /i, /2, ■ ■ ■ , /d € L°°{ij), where we write /at ~ to denote that H/tv — 
9n\\2 — )• as — ^ oo. 

In the case d = 1 (so we deal only with powers of a single transformation) the 
possible structures of such characteristic factors have been completely understood 
as those of pro-nilsystems, first in the case of linear averages |[2Tll32]| and then also 
for their higher-degree polynomial relatives |[20ll24l . 

However, for general d, even when each pi is a linear mapping the possible charac- 
teristic factors seem much more complicated; although it can be shown abstractly 
that minimal characteristic factors exist, they have so far largely resisted useful de- 
scription. The chief innovations of the papers [4 , 3] were methods for constructing 
an extension a system T : TH^ r\ {X,fi) to a larger system T : I? r\ {X,fl) 
(which depends on the linear mappings pi) in which a characteristic tuple of fac- 
tors for the convergence of these linear averages could be found with an especially 
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simple structure. Knowing this structure then enabled (together with an appeal to 
various existing machinery) new ergodic-theoretic proofs of first the convergence 
of such averages in L?'{ii) and then the Multidimensional Multiple Recurrence 
Theorem. 

Effectively, this approach shifts our viewpoint from the setting of an individual 
system of commuting transformations (and factors thereof) to the category of all 
such systems. While a given system can fail to exhibit among its own factors all of 
the structural features that can be exploited to examine nonconventional averages, 
these emerge upon passing to a sufficiently enriched extension of that system (or, 
equivalently, upon considering more general joinings of it to other systems of var- 
ious special kinds). We should note that it seems likely that the extensions with 
improved behaviour for our purposes seem to be highly arbitrary and will often be 
obtained after several steps that both involve some arbitrary choices and could be 
performed in different orders. In this sequence of papers we will generally refer to 
extensions that admit some simple characteristic factors for a given system of aver- 
ages as pleasant extensions of the original system, although the particular class of 
'simple' characteristic factors that we use will vary from one context to the next. 

In fact, the strategy of passing to an extension where the behaviour of noncon- 
ventional averages is more easily described also has a precedent in various earlier 
papers, notably the work 1.16] of Furstenberg and Weiss (we will return to the rela- 
tionship between their work and ours several times later). However, while in their 
work it can later be proved that the characteristic factors of the original system 
must have taken the same form as those obtained in the extension, in the multi- 
dimensional setting the passage to an extension leads to a genuine reduction in 
complexity of description of the characteristic factors, and recent works in this 
area have begun to exploit this idea much more extensively. (For example, Bernard 
Host 1 18] has given a new and rather more efficient construction of an extension 
with much the same desirable properties as those constructed in [4].) 

Here we develop further this approach to nonconventional averages. An undesir- 
able feature of the constructions of extensions in ||4l [3l [181 is that the lifted com- 
muting transformations % generally lose any algebraic structure that might have 
been known to hold a priori among the Tj. Two particular kinds of structure that 
can be of interest for applications are: 

• the existence of roots, such as some Si such that Tj = Sf; 

• linear relations that may hold among the original transformations Tj, such as 
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In this paper and its sequels we will begin to see what pleasant extensions can be 
found that retain such additional features. Concerning the existence of roots, in 
Section l42l we will find that one can recover essentially the same result as in [141 : 

Theorem 1.1 (Pleasant extensions of linearly independent linear averages). Any 

Tj'^ -system {X, fi, T) has an extension vr : {X, jl,T) ^ {X, /i, T) such that for any 
linearly independent pi, p2, . . . , pfe G f/ie averages 

N d 
n=l i=l 

admit a characteristic tuple of factors of the form 

where for a transformation S we write (q for some factor map generating, up to 
negligible sets, the a-algebra of sets left invariant by S. 

The point here is that, setting A := Zpi + • • • + Zp^ < Z'*, the older constructions 
give only an extension of the subaction T^^. Notice that we obtain a single ex- 
tended system {X,jl,T) that enjoys the above simplified characteristic factors for 
every tuple of linearly independent directions. We will refer to such an extension 
as a pleasant extension for linearly independent linear averages. 

More importantly than this result, this first paper will introduce various general 
ideas needed in preparation for the more sophisticated results of the sequel [2J, 
especially the notion of 'satedness' for probability-preserving systems (Subsection 
3.1 below) which will be relied on repeatedly in proving all the main results of that 
paper. 

In m we shall begin to address the second kind of algebraic structure listed above. 
We will examine one simple case in detail, but conjecture that our methods could 
eventually be extended to a much more general result. We will consider the case 
of three directions TP\ T^^ in a Z^-system that are in general position with 
the origin 0: that is, such that no three of the points pi, p2, pa and lie on a 
line. For the associated linear averages we will show how to construct an extended 
Z^-system in which the characteristic factors take a special form, which will give 
us a notion of pleasant extensions for triple linear averages subject to this kind of 
linear dependence. 

Of course, we do pay a price for insisting that the Z^-structure of the action be 
preserved, in that the characteristic factors we eventually obtain are not as simple 
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as the pure joins of isotropy factors that emerge in the linearly independent case. 
The additional ingredients we need are Z^-actions given by pairs of commuting 
rotations on two-step pro-nilmanifolds (or, to be precise, direct integrals of such 
actions). 

Theorem 1.2 (Pleasant extensions for linearly dependant triple linear averages). 

Any system T : 1? r\ {X,fj.) has an extension n : {X,fl,T) — )■ {X,fj,,T) such 
that for any pi, p2, ps G that are in general position with the origin the 
averages 

1 ^ 

n=l 

admit characteristic factors ^j, i = 1,2,3, of the form 

?j — <50 V C,o V C,o V (,nji 2> 

where 2 '■^ maximal factor of {X, ^u, T) generated by direct integrals of two- 
step nilsy stems (the precise meaning of this will be elaborated in [2]). 

From this we will also be able to deduce a pleasant-extensions result for certain 
double quadratic averages, following an application of the well-known van der 
Corput estimate. 

Theorem 1.3 (Pleasant extensions for some double quadratic averages). Any sys- 
tem of two commuting transformations Ti,T2 r\ (X, ^) has an extension vr : 
(X, /i, Ti , T2) — )• {X, fj,, Ti , T2) in which the averages 

1 ^ 

n=l 

admit characteristic factors of the form 

m>l 

This, in turn, gives us enough control over these quadratic averages to complete a 
proof of their norm convergence. 

Theorem 1.4. IfTi,T2:Zr\ (X, fi) commute then the averages 

1 ^ 

n=l 

converge in L'^ip-) as N ^ 00 for any /i, /2 G L°°{^). 
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Although this new convergence result is modest in itself — it is only one special 
case of the much more general conjecture of norm convergence for all multidimen- 
sional polynomial averages, which remains out of reach for the time being — we 
suspect that the methods developed in this paper and its sequel will ultimately have 
more far-reaching relevance to this question, and potentially to other questions on 
the structure of joinings between different classes of system in the ergodic theory 
of Z'^-actions. 

Acknowledgements My thanks go to Vitaly Bergelson, Zoltan Buczolich, Bernard 
Host, Bryna Kra, Mariusz Lemaiiczyk, Emmanuel Lesigne, Thierry de la Rue, Ter- 
ence Tao, Jean-Paul Thouvenot, Dave Witte Morris and Tamar Ziegler for several 
helpful discussions, and to the Mathematical Sciences Research Institute (Berke- 
ley) for its hospitality during the 2008 program on Ergodic Theory and Additive 
Combinatorics. 

2 Background definitions and general results 

2.1 Measurable functions and probability kernels 

We will work exclusively in the category of standard Borel probabiUty spaces 
{X, Sx, m)' so will often suppress mention of their a-algebras. Given stan- 
dard Borel spaces X, Y and Z, a completed Borel probability measure /i on X, 
a (T-subalgebra E C Ex and measurable functions (p : X ^ Y, ijj : X ^ Z, 
we wiU write that is /ii-vlrtually E!-measurable if there is some S-measurable 
map ipi : X ^ Z such that ip = ijji /U-almost everywhere, or similarly that ip is 
//-virtually a function of (j) if there is some measurable function 9 : Y ^ Z such 
that i/; = 9o(p; these two definitions are related by the usual correspondence (up to 
negligible sets) between cr-subalgebras and maps that generate them for standard 
Borel spaces. 

Factor maps from one probability space to another comprise the simplest class of 
morphisms between such spaces, but we will sometimes find ourselves handling 
also a weaker class of morphisms. Suppose that Y and X are standard Borel 
spaces. Then by a probability kernel from F to X we understand a function 
P : y X Ex ^ [0, 1] such that 

• the map y i-> P(y, A) is Ey-measurable for every A G Ex; 

• the map A P{y, A) is a probability measure on Ex for every y eY. 
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The first of the above conditions is then the natural sense in which the assignment 
y I— P{y, • ) of a probability distribution is measurable in y; indeed, a popular 
alternative definition of probability kernel is as a measurable function from Y to 
the set Pr X of Borel probability measures on X. In ergodic theory this notion is 
also often referred to as a 'quasifactor' : see, for example. Chapter 8 of Glasner 1 17 1, 
where this alternative convention and notation are used. We will write P -.Y ^ X 
when P is a probability kernel from Y to X. 

Given a kernel P : Y ^ X and a probability measure v on Y, we define the 
pushforward measure P^ on X by 

P#u{A) := j^P{y,A)u{dy)- 

this measure on X can be interpreted as the law of a member of X selected ran- 
domly by first selecting a member of Y with law v and then selecting a member 
of X with law P{y, ■ ). This extends standard deterministic notation: given a mea- 
surable function (/> : y — )• X, we may associate to it the deterministic probability 
kernel given by P{y, • ) = (the point mass at the image of y under 0), and 
now P^v is the usual push-forward measure 4>^y- 

Certain special probability kernels naturally serve as adjoints to probability-preserving 
maps, in the sense of the following theorem. 

Theorem 2.1. Suppose that Y and X are standard Borel spaces, that fi is a prob- 
ability measure on X and that (p : X ^ Y is a measurable map. Then, denoting 
the pushforward by v, there is a v -almost surely unique probability kernel 
P -.Y ^ X such that fi = P^u and which represents the conditional expectation 
with respect to (p: for any f G L^{lj), the function 

xi^ f{x)P{(l){xi),dx) 
Jx 

is a version of the ^-conditional expectation of f with respect to 0^^(Sy). 

We also write that this P represents the disintegration of fi over (j). A general prob- 
ability kernel P : Y ^ X represents the disintegration over (j) of some measure 
that pushes forward onto v if and only if P{x, ■ ) i^{dy) and P{y, ■ ) i^{dy) 
are mutually singular whenever Af] B = f/i. 

Proof See Theorem 6.3 in Kallenberg □ 
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2.2 Systems, subactions and factors 



In this paper we shall spend a great deal of time passing up and down from sys- 
tems to extensions or factors. Moreover, sometimes one system will appear as a 
factor of a 'larger' system in several different ways (most obviously, when we work 
with a system recovered under the different coordinate projections from some self- 
joining). For this reason the notational abuse of referring to one system as a factor 
of another but leaving the relevant factor map to the understanding of the reader, 
although popular and useful in modern ergodic theory, seems dangerous here, and 
we shall carefully avoid it. In its place we substitute the alternative abuse, slightly 
safer in our circumstances, of often referring only to the factor maps we use, and 
leaving either their domain or target systems to the reader's understanding. Let us 
first set up some notation to support this practice. 

If r is a l.c.s.c. group, by a F-system (or, if T is clear, just a system) we understand 
a coarsely-continuous probability-preserving action T iT {X, jj.) on a standard 
Borel probability space. We will often alternatively denote this space and action by 
(X, fi, T), or by a single boldface letter such as X. If A < F is a closed subgroup 
we denote by T^^ : A r\ (X, ^) the action defined by {T^^ f := for 7 G A, 
and refer to this as a subaction, and if X = {X, /x, T) is a F-system we write 
similarly X^^ for the system {X, fi, T^^) and refer to it as a subaction system. 

A factor from one system {X, /i, T) to another (Y, v, S) is a Borel map tt : X ^ Y 
such that TT^^ = V and vr o T''' = 5''' o vr for all 7 G F. Given such a factor, we 
sometimes write TIt^ to denote the action S with which T is intertwined by vr. 

Any factor vr : X — )• Y specifies a globally T-invariant cr-subalgebra of Sx in 
the form of 7r^^(Ily). Two factors vr and ip are equivalent if these cr-subalgebras 
of that they generate are equal up to ^-negligible sets, in which case we shall 
write vr ~ V; this clearly defines an equivalence relation among factors. 

It is a standard fact that in the category of standard Borel spaces equivalence classes 
of factors are in bijective correspondence with equivalence classes of globally in- 
variant (j-subalgebras under the relation of equality modulo negligible sets. A 
treatment of these classical issues may be found, for example, in Chapter 2 of 
Glasner ifTTl . Given a globally invariant cr-subaglebra in X, a choice of factor 
vr : X — )• Y generating that cj-subalgebra will sometimes be referred to as a coor- 
dinatization of the fi-subalgebra. Importantly for us, some choices of coordinatiz- 
ing factor vr may reveal some additional structure of the factors more clearly than 
others. For this reason, given one coordinatization vr : X — Y and an isomor- 
phism : Y — Y', we shall sometimes refer to the composition ^ o vr : X — Y' 
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as a recoordinatization of tt. 

More generally, the factor tt : {X,n,S) — )■ {Y,i',S) contains V' : iX,IJ',T) 
{Z, 6, R) if 7r~^(Sy) D iIj~^{'Ez) up to /x-negUgible sets. It is a classical fact that 
in the category of standard Borel spaces this inclusion is equivalent to the existence 
of a factorizing factor map : {Y, u, S) — > {Z, 9, R) with ip = (p o -k fi-a.s., 
and that a measurable analog of the Schroeder-Bemstein Theorem holds: tt if) 
if and only if a single such (f) may be chosen that is invertible away from some 
negligible subsets of the domain and target. If vr contains ijj we shall write tt ^ i/" 
or ^ ;^ TT. It is clear that (up to set- theoretic niceties) this defines a partial order 
on the class of ^-equivalence classes of factors of a given system. We will extend 
the above terminology to that of coordinatizations and recoordinatizations of 
families of factors of a system in terms of the appropriate commutative diagram of 
isomorphisms. 

Given a F-system X = {X,/j,,T), the cr-algebra of sets A € for which 
IJ,{AAT"' (A)) = for all 7 G T is T-invariant, so defines a factor of X. More 
generally, if F is Abelian and A < F is closed then we can consider the cr-algebra 
Sj''^ generated by all T^^-invariant sets: we refer to this as the A-isotropy factor 
and write Zq for some new system that we adopt as the target for a factor map 
Cq^'^ that generates and Z^^^ for the standard Borel space underlying Zq^ 

Note that in this case the Abelianness condition (or, more generally, the condition 
that A < F) is needed for this to be a globally T-invariant factor. If Ti and T2 are 
two commuting actions of the same group F on (X, /i) then we can define a third 
action TiT2^ by setting {TiT2^)'^ := T^T^ , and in this case we sometimes 
write Cj'=^2 . X Z|^i='^2 place of C(f'^' ' : X -> zj^'^^ \ similarly, if 
S* C F and A is the group generated by A, we will sometimes write Zj^^ for Zq 
and similarly. 

An important construction of new systems from old is that of relatively inde- 
pendent products. If Y = (F, S) is some fixed system and ttj : Xj = 
{Xi, iii,Ti) Y is an extension of it for i = 1,2, ... ,k then we define the 
relatively independent product of the systems Xj over their factor maps tt^ to be 
the system 

n = ( II Xi, (g) Aii,^! X T2 X ■ • • X Tk) 

{'JVi=TV2=---=TVk} {•7ri=7r2=...=7rfc} {7ri=7r2=...=7rfc} 
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where 



[[ Xi := {{xi,X2,...,Xk) € Xi X X2 X ■ ■ ■ X Xk : 

[tTI=1T2 = ...=1Tj^} 

7ri(xi) = -K2ix2) = ... = -Kk{Xk)}, 
{7ri=7r2 = ...=7rfe} ^ i=l 

P 

and Pi : Y ^ Xi is a probability kernel representing the disintegration of fXi over 
TTj. In case A; = 2 we will write this instead as Xi Xj^^^^^} ^2> and in addition if 
Xi = X2 = X and vri = 7r2 = tt then we will abbreviate this further to X x X, 
and similarly for the individual spaces and measures. 



2.3 Measurable selectors 

At several points in this paper we need to appeal to some basic results on the 
existence of measurable selectors, often as a means of making rigorous a selection 
of representatives of one or another kind of data above the ergodic components of 
a non-ergodic system. 

Theorem 2.2. Suppose that {X, T,x) and {Y, Sy) are standard Borel spaces, that 
A <^ X is Borel and that tt : X ^ Y is a Borel surjection. Then the image it {A) 
lies in the v'^ -completion ofTiyfar every Borel probability measure v on {Y^ Sy) 
with completion v^, and for any such u there is a map f : B ^ A with domain 
B G^Y such that B C tt{A), z^^(7r(^) \ B) = and tt o f = id^. □ 

Proof See, for example, 4230 and its consequence 424X(h) in Fremlin [131. D 

Definition 2.3 (Measurable selectors). We refer to a map f as given by the above 
theorem as a measurable selector for the set A. 

Remark We should stress that this is only one of several versions of the 'measur- 
able selector theorem', due variously to von Neumann, Jankow, Lusin and others. 
Note in particular that in some other versions a map / is sought that select points 
of A for strictly all points of tt{A). In the above generality we cannot guarantee 
that a strictly-everywhere selector / is Borel, but only that it is Souslin-analytic 
and hence universally measurable (of course, from this the above version follows 
at once). On the other hand, if the map tt\a is countable-to-one, then a version of 
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the result due to Lusin does guarantee a strictly-everywhere Borel selector /. This 
version has already played a significant role in our comer of ergodic theory in the 
manipulation of the Conze-Lesigne equations (see, for example, HI [16117^), and so 
we should be careful to distinguish it from the above. A thorough account of all 
these different results and their proofs can be found in Sections 423, 424 and 433 
of Fremlin US. < 

In the right circumstances it is possible to strengthen Theorem |2.2| to obtain a Borel 
selector that is invariant under a group of transformations, by making use of a 
coordinatization of the invariant factor. 

Proposition 2.4. Suppose that {X, T,x) and (Y, Sy) are standard Borel spaces, 
A X is Borel and tt : X ^ Y is a surjective Borel map, and in addition 
that T : T r\ (X, Sx) is a jointly measurable action of a l.c.s.c. group such 
that TT is a factor map, so it o T"^ = S'^ o n for some jointly measurable action 
S :T r\ (y, Sy), and that A is T -invariant. Then for any S -invariant probability 
measure u on {Y, Sy) with completion v'^ there are an S-invariant set B G Sy 
such that B C t:{A) and u'^{'k{A) \ B) = and an S-invariant map f : B ^ A 
such that IT o f = ids- 

Proof Let /q : Bq ^ Ahe an ordinary measurable selector as given by Theo- 
rem l2.2[ and let u be any S'-invariant probability measure on {Y, Sy ). This Bq must 
be z^-almost 5-invariant, simply because vr(j4) is T-invariant and v'^{BAtt{A)) = 
0. Using local compactness and second countability, let (Fj)j>i be a countable 
compact cover of F, and also let mr be a left-invariant Haar measure on F. From 
the joint measurability of T it follows that the set 

B:={yeY: mr{7 G T : S^y) Y \ Bq} = O} 

= f^{yeY: mr{7 G : S"'{y) G ^o} = mr(F,)} 

i>l 

is Borel, T-invariant and satisfies ^{BqAB) = 0. 

We now let : (Y, Sy , u) — )■ (Z, Sz, ^) be any coordinatization of the invariant 
factor Sy ; it is easy to see that this may be chosen so that there exists some C G Hz 
such that B = (^^{C). We can now use B and C to 'tidy up' our original selector 
/o. Indeed, by the S-invariance of ( and the fact that for every y £ B we have 
S'^{y) G Bq for some (indeed, almost all) 7 G F, we must have C(^o) 5 C- 
Therefore by applying the ordinary Measurable Selector Theorem a second time 
we can find a Borel subset D £ T.z with D C C and 9{C \ D) = and a Borel 
section -q : D ^ Bq such that C ° ">] = ids ; and so now replacing Bq with B and 
the map /o with f : y ^ /o(??(C(y))) completes the proof. □ 
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Definition 2.5 (Invariant measurable selectors). We refer to a map f as given by 
the above proposition as a T-invariant measurable selector for the set A. 

3 Idempotent classes of systems 
3.1 Basic properties of idempotent classes 

Central to this paper will be a systematic exploitation of a property of certain sys- 
tems according to which they can be joined to certain other classes of system only 
in simple ways. This key definition, although very abstract and very simple, will 
repeatedly prove surprisingly powerful. We will introduce it after some other pre- 
Uminaries about classes of F-systems. 

Lemma 3.1. Suppose that C is a class of T -systems (formally, C is a subcategory 
ofT-Sys) that contains the trivial system and is closed under isomorphisms, arbi- 
trary joinings and inverse limits. Then any T-system X has an essentially unique 
maximal factor in the class C. 

Proof It is clear that under the above assumption the class 

{S C Ex : S is a T-invariant cr-subalgebra such that the associated factor is in C} 

is nonempty (it contains {0, X}), upwards directed (because C is closed under join- 
ings) and closed under taking a-algebra completions of increasing unions (because 
C is closed under inverse Umits). There is therefore a maximal cr-subalgebra in this 
set. □ 

Definition 3.2 (Idempotence). A class of systems C is idempotent if it contains the 
trivial system and is closed under isomorphisms, joinings and inverse limits. In 
this case, we will write that X is a Q-system ifK is a system in the class C, and for 
arbitrary X we write : X — > CX.for an arbitrarily-chosen coordinatization of 
its maximal C-factor given by the above lemma. 

It is clear that ?/ tt : X — > Y then Cc^ ^ Cc^ o tt, and so there is an essentially 
unique factorizing map, which we denote by Ctt, that makes the following diagram 
commute: 
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In addition, we shall abbreviate X x^x X to X Xc X, and similarly for the indi- 
vidual spaces and measures defining these systems. 

The reason for this terminology lies in the observation that the assignment X i-> 
CX defines an autofunctor of the category F-Sys, and the assignment X i-> 

defines a natural transformation from the identity functor to this autofunctor We 
can work with such functors quite generally, and a simple definition-chase shows 
that a functor F : F-Sys — > F-Sys together with a natural transformation idp.gyg 

F correspond to a class of systems C as above if and only if F is idempotent (that 
is, F(FX) = FX for every X) and the natural transformation is the identity on the 
subcategory of systems of the form FX. Indeed, it would be possible to develop 
the theory of the coming sections by working entirely with autofunctors rather than 
classes, but I do not know of any examples of autofunctors that are useful as such 
but do not arise from idempotent classes as above. 

The name we give for our next definition is also motivated by this relationship with 
functors. 

Definition 3.3 (Order continuity). A class of T -systems C is order continuous if 

whenever (X(m))m>o. (V'(fc)^)m>jfc>o is an inverse sequence ofV-systems with in- 
verse limit X, (V'(m))m>o 'we have 

Cl = V C?'"' o : 
m>0 

that is, the maximal C-f actor of the inverse limit is simply given by the (increasing) 
join of the maximal C-factors of the contributing systems. 



Examples The following idempotent classes will be of particular importance (in 
all cases idempotence is routine to check): 

1 . Given a fixed normal subgroup A < F, Zq denotes the class of systems for 
which the A-subaction is trivial. 
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2. More generally, for A as above and any n G N we let Z„ denote the class 
of systems on which the A-subaction is a distal tower of height at most n, in 
the sense of direct integrals of compact homogeneous space data introduced 
in UJ to allow for the case of non-ergodic systems. 

3. We can modify the previous example by placing some additional restrictions 
on the permissible distal towers: for example, Z^j^ ^ comprises the systems 
with A-subaction a distal tower of height at most n and in which each isomet- 
ric extension is Abehan. We will meet other, even more restricted idempotent 
classes contained within this one later. 

(We note in passing that an example of a natural class that is not closed under 
joinings when F = Z is the class WM"*" of systems that are system-disjoint from all 
weakly-mixing systems, as has been proved by Lemariczyk and Parreau in B251 .) < 

Example Although all the idempotent classes that will matter to us in this paper 
can be shown to be order continuous, it may be instructive to exhibit one that is 
not. Let us say that a system X has a finite-dimensional Kronecker factor if 
its Kronecker factor : X ^ can be coordinatized as a direct integral 
(see Section 3 of [1]) of rotations on some measurably- varying compact Abelian 
groups all of which can be isomorphically embedded into a fibre repository for 
some fixed D £ N (this includes the possibility that the Kronecker factor is finite or 
trivial). It is now easy to check that the class of Z-systems comprising all those that 
are either themselves finite-dimensional Kronecker systems, or have a Kronecker 
factor that is not finite-dimensional (so we exclude just those systems that have a 
finite-dimensional Kronecker factor but properly contain it), is idempotent but not 
order continuous, since any infinite-dimensional separable group rotation can be 
identified with an inverse limit of finite-dimensional group rotations. < 

Definition 3.4 (Hereditariness). An idempotent class C is hereditary if it is also 
closed under taking factors. 

Example Examples 1 , 2 and 3 in the first list above are hereditary. In the case of 
Zq this is obvious; for the higher distal classes Z^ or their Abelian subclasses it is 
an easy consequence of the Relative Factor Structure Fheorem 6.4 of |T| (applied 
to the relatively independent self-joining of the n-step distal system in question 
over the factor we wish to analyze). On the other hand, the separate example above 
involving the dimensionality of the Kronecker factors is clearly not hereditary. < 

Definition 3.5 (Join). IfQi, C2 are idempotent classes, then the class Ci V C2 of 
all joinings of members of Ci and C2 is clearly also idempotent. We call Ci V C2 
the join of Ci and C2. 
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Lemma 3.6 (Join preserves order continuity). If Ci and C2 are both order contin- 
uous then so is Ci V C2. 

Proof Let (X(^))m>o> (V'(fc)^)m>A,>o be an inverse sequence with inverse limit 
(''A(m))m>o- Then CqvC2 maximal factor of X that is a joining of a Ci- 
factor and a C2-factor (so, in particular, it must be generated by its own Ci- and 
C2-factors), and hence it is equivalent to C,^^ . Therefore any / G (^u) that is 
V -measurable can be approximated by some function of the finite-sum form 
YlipdpA ' 9p,2 with each (7^ j € L'^ip) being Cj-measurable, and now since each 
Cj is order continuous we may further approximate each gp^i by some /ip j o ip^jj^-^ 
for a large integer m and some Cj-measurable hp^i € L°°(/i(m)). Combining these 
approximations completes the proof. □ 

Examples Of course, we can form the joins of any of our earlier examples of 
idempotent classes: for example, given T = 1? and Pi, P2, Ps G we can form 
Zq^ V Zq^ V Zq^ This particular example and several others like it will 
appear frequently throughout the rest of this paper. We will remark shortly that 
joins of hereditary idempotent classes need not be hereditary. < 

The following terminology will also prove useful. 

Definition 3.7 (Joining to an idempotent class; adjoining). If X. is a system and C 
is an idempotent class then a joining ofJ^to C or a C-adjoining of X is a joining 
ofX and ^ for some Y G C. 

Definition 3.8 (Subjoining). Given idempotent classes Ci, C2, . . . , C^, a system X 
is a subjoining o/Ci, C2, . . . , C^ if it is a factor of a member o/Ci V C2 V • • • V C^. 

Finally we have reached the key definition that will drive much of the rest of this 
paper. 

Definition 3.9 (Sated system). Given an idempotent class C, a system X is C-sated 
if whenever tt : X = {X, /i, T) — )• X is an extension, the factor maps vr and C,q' on 
X are relatively independent over o tt = Cvr o under jl. 

An inverse sequence is C-sated if it has a cofinal subsequence all of whose systems 
are C-sated. 

Remark This definition has an important precedent in Furstenberg and Weiss' 
notion of a 'pair homomorphism' between extensions elaborated in Section 8 of [i l6i 
Here we shall make much more extensive use of this basic idea. <l 



16 



Lemma 3.10. IfC is an idempotent class, (X, T) is C-sated, {Y, v,S) G C and 
A is a (T X S)-invariant (/i, v)-joining then ^(j^^^'-'*'^^'^) ^ .^^^ 

Proof Let tti , 7r2 : {Xx Y, A) — {X, fi) , (Y, v) be the first and second coordinate 
projections respectively. The relation q^^^^^'"^^^) >- ^C^'M-T") ^ -^^ clear. On 
the other hand, the C-satedness of {X, /i, T) applied to the factor map tti implies 
that vTi is relatively independent from ^^^^^'^'^^•5') ^ygj. ^C^.m.T") under A, and 
this implies the reverse containment. This completes the proof. □ 

The crucial technical fact that turns satedness into a useful tool is the ability to 
construct sated extensions of arbitrary systems. This can be seen as a natural ab- 
straction from Proposition 4.6 of [4] and Corollary [3|. 

Theorem 3.11 (Idempotent classes admit multiply sated extensions). If {Ci)i^i is 

a countable family of idempotent classes then any system Xq admits an extension 
TT : X — )• Xq such that 

• X is Ci-sated for every i S I; 

• the factors vr and Co S^^^^^te the whole ofK. 

We shall prove this result after a preliminary lemma. 

Lemma 3.12. If C is an idempotent class then the inverse limit of any C-sated 
inverse sequence is C-sated. 

Proof By passing to a subsequence if necessary, it suffices to suppose that (X(^) ), 

{'ip^^^)m>k>o is an inverse sequence of C-sated systems with inverse limit X^o^^), 

{ip(m))m>i, and let vr : X — > X(oo') be any further extension and / G L°°{n(^ao))- 
We will commit the abuse of identifying such a function with its lift to any given 
extension when the extension in question is obvious. With this in mind, we need to 
show that 

E(/|Cf) = E(/|C?°°'). 
However, by the C-satedness of each X(^) , we certainly have 

e(E(/|Vm)IC?) = e(/|C?-'), 

and now as m — oo this equation converges in L^{fJ-) to 

E(/|C?) = E(/| lim(C?<'"^oV'(^))). 
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By monotonicity we must have 



and so by sandwiching we must also have the equality of conditional expectations 
desired. □ 

Proof of Theorem 13.111 We first prove this for / a singleton, and then in the 
general case. 

Step 1 Suppose that / = {i} and Cj = C. This case will follow from a simple 
'energy increment' argument. 

Let {fr)r>i be a countable subset of the L°°-unit ball {/ e : ||/||oo < 1} 

that is dense in this ball for the L^-norm, and let (rj)j>i be a member of in 
which every non-negative integer appears infinitely often. 

We will now construct an inverse sequence (X(„))m>o. (V'(fc)'*)m>fc>o starting 
from X(o) := Xq such that each X(^_|_i) is a C-adjoining of Xj-^^). Suppose 

that for some m-i > we have already obtained (^(m))Zto, (^(^^)mi>m>fc>o 
such that idj!s:(„^) — Cc '""^^ V V'l^ ■ We consider two separate cases: 



• If there is some further extension vr : X — X(„^) such that 

ww^i ° ° - 1 cf )ii2 > m,,^jfr^i o I c?-^)iii + 2--, 

then choose a particular vr : X — ^(mi) such that the increase 

WW-^i ° V'foT^ ° - 1 cf )ll2 - IIE.(.,)(/.., o v^f-) I c?<-))||2 

is at least half its supremal possible value over all extensions. By restricting 
to the possibly smaller subextension of X — )• X^^^) generated by vr and 
we may assume that X is itself a C-adjoining of ^(mi) hence of Xq, and 

now we let X(m^+i) := X and ■= (the other connecting factor 

maps being now determined by this one). 

• If, on the other hand, for every further extension vr : X — ^(^mi) we have 
then we simply set X(„^+i) := X(„^) and vf™')"*"^^ := 
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Finally, let X(oo), (V'(m))m>o be the inverse limit of this sequence. We have 

rri>0 m>0 

m>0 m>0 

SO X^oQ-) is still a C-adjoining of Xq. To show that it is C-sated, let tt : X — > X^^q) 
be any further extension, and suppose that / G L°°(//((^)). We will complete the 
proof for Step 1 by showing that 

EA(/ovr|cf) = E^(^,(/|C?-^)o7r. 

Since X((^) is a C-adjoining of X, this / may be approximated arbitrarily well in 

by finite sums of the form gp ■ hp with gp being bounded and Q 
measurable and hp being bounded and ^/'(o)"™s^surable, and now by density we 
may also restrict to using hp that are each a scalar multiple of some frp o ^^^q^, so 
by continuity and multiUnearity it suffices to prove the above equality for one such 

product g-{fr° V'(o))- Since g is Cc -measurable, this requirement now reduces 
to 

E/l(/r O V'(O) °T^\Cc) = ^1^(00) ifr ° V'(O) I Cc ) ° ^- 

Since ^ o tt, this will follow if we only show that 

||EA(/r o V'(O) o vr I ChWl = l|EM(^)(/r- o V'(O) I Cc'^')\\l 

Now, by the martingale convergence theorem we have 

l|E^(.)(/.o^[o? K?^')ll2 t l|E;.(^)(/.oV;(o) |C?-')||i 
as m — > 00. It follows that if 

WW' ° V» O TT I C?)||i > ||E^(^, {fr O V^(o) I C?^°°^)||i 

then for some sufficiently large m we would have Vj^ = r (since each integer 
appears infinitely often as some r^) but also 

II E.(.,, o ^[-+^) I C?""-) ) Hi - II E,^^^ ifr o I C?-' ) Hi 

< II E,^^, ifr o ,/;(o) I C?'-' ) Hi - II E,(^^ (/. o I Cj'-^ ) Hi 

< I (11 EM/. ° A^oT^ o n I C?)||i - II E,^^^ (/ o ^J™) I Cj<™))||i) 
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and 



l|E,(/. o V^fo^) o vr I Cf )||2 > ||E,^„, (/ o V^f™) I C?(-')||2 + 2-, 

so contradicting our choice of X(^_|_i) — )• X(„) in the first alternative in our con- 
struction above. This contradiction shows that we must actually have the equality 
of -norms asserted above, as required. 

Step 2 The general case follows easily from Step 1 and a second inverse limit 
construction: choose a sequence {im)m>i S in which each member of / ap- 
pears infinitely often, and form an inverse sequence (X(m))m>o> (V'(fc)'')m>fc>o 
starting from X(o) •= such that each (X(,„)) is Cj^-sated for m > I. The 
inverse limit X is now sated for every Cj, by Lemma [3.12l □ 

Remark Thierry de la Rue has shown me another proof of Theorem |3.11| that fol- 
lows very quickly from ideas contained in his paper ||271 with Lesigne and Rittaud, 
and which has now received a nice separate writeup in ifTTl . The key observation 
is that 

An idempotent class C if hereditary if and only if every system is C-sated. 

This in turn follows from a striking result of Lemariczyk, Parreau and Thouvenot |[26ll 
that if two systems X and Y are not disjoint then X shares a nontrivial factor with 
the infinite Cartesian power Y^°°. Given now an idempotent class C and a system 
X, let C* be the hereditary idempotent class of all factors of members of C, and let 
Y be any C-system admitting a factor map vr : Y — )• C*X (such exists because by 
definition C*X if a factor of some C-system). Now forming X := X x j^^x Y, a 

quick check using the above fact shows that CX = C*X, and that this is equivalent 
to the C-satedness of X. 

As remarked previously, a routine argument shows that our basic examples Zq , 
and Z^j^ ^ are all hereditary, and hence that any system is sated with respect to 
any of them. However, joins of several hereditary idempotent classes need not be 
hereditary. For example, let X = (T^, Haar, R{a,o)j R{o,a))' where denotes a 
rotation by /? G and a G (M \ Q)/Z, and let ei , e2 be the standard basis vectors 
of Z^. Then clearly Cj''' = T^s-i, the projection onto the (3 — i)^^ coordinate, and 
so Cq""^ V Co"^^ ~ idx and therefore X is itself a system in the class Zq^ V Zq^. 
However, Zq^^'^^X is the factor generated by the SW-NE diagonal circles in T^, 
so Z^i+'^^x is a factor of a system of class Z^^ V Z^^ but Z^^ V Z^^ (z^i+'^^x) 
is the trivial system, so Zq^'''''^X is not (Zq^ V ZQ^)-sated. (Nevertheless, some 
preliminary results in Section 7 of fT] indicate that such counterexamples must be 
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rather special, and we suspect that the machinery of that paper and the present one 
may have more to say on this question in the future.) < 



It will serve us well to adopt a special name for a particular case of the above 
multiple satedness that will recur frequently. 

Definition 3.13 (Full isotropy-satedness). A system T : r\ (X, jj) is fully 
isotropy-sated (FIS) if whenever pi : ^ U^, i = 1,2, ... ,k, are isomorphic 
embeddings then the system {X, ji, T) is {TJq V Zq^ V • • • V Z^'')-sated. 

Corollary 3.14. Any IJ^ -system admits an FIS extension. □ 

3.2 Subactions and insensidvity of idempotent classes 

Given an l.c.s.c. group and closed subgroup A < F there is an obvious forgetful 
functor F-Sys — > A-Sys, X i-> Xl^^. Some idempotent classes C in F-Sys actually 
make sense in both categories, in the sense that X G C if and only if Xl^"^ € Co 
for some idempotent class Co of A-systems. Loosely, these are the classes that 
are defined in terms of properties depending only on the subaction X^^, the most 
obvious example being Zq . Importantly, in this case forming the maximal Zq- 
factor of a F-system X and of its subaction system Xl^^ give measure-theoretically 
the same factor space. This is a simple but important phenomenon that we will need 
to appeal to later (although we shall sometimes suppress the distinction between a 
class C of F-systems and the corresponding class C'^^ of A-systems). 

Definition 3.15 (Insensitivity of idempotent classes). An idempotent class C of K- 
systems is insensitive to the forgetful functor to k-subactions if whenever X is a 
V -system, the factor map C,q : X — > CX actually intertwines the whole T -action 
T with some V-action on Q.X (equivalently, if the a-subalgebra {(q- )~^(Scx) is 
globally T-invariant). 

In this case we can naturally extend the definition of the class C to the category F- 
Sys by letting CX be the T -action with which T is intertwined by Cc"^ ^ —'■ Cc'' 
we will generally use the same letter Cfor the idempotent class in either category. 

In general, an idempotent class C ofV-systems is insensitive to the forgetful func- 
tor to K-subactions if it is the extension to F-Sys of an insensitive idempotent class 
of A-systems. 

This notion of insensitivity will be most important to us in view of its consequences 
for satedness. In order to understand these, however, we must first introduce an 
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important method for building system extensions with desirable properties. For 
this we restrict to the setting of countable discrete Abelian groups. 

Suppose that F is a countable discrete Abelian group and A < F a subgroup, 
that X is a F-system and that ^ : X' = (X',//',S") — )• X^^ is an extension of 
its A-subaction system. It can easily happen that there does not exist an action 
T' : T {X' , fi') such that {T'Y^ = S'. However, if we permit ourselves to 
pass to further system extensions we can retrieve this situation, and this will be 
crucial in cases where our analysis of characteristic factors in the first place gives 
information only about a sublattice of Z"', rather than the whole group. 

Here we will introduce a particular construction of such a further extension (al- 
though in general we will use only the abstract fact of the existence of such an 
extension). We first need a basic result and definition from elementary group the- 
ory. 

Definition 3.16 (Remainder map). Tf A < F are as above and $7 C F a fun- 
damental region for the subgroup A ( that is, 0, is a subset containing exactly one 
member of every coset in F/A), then there is a remainder map F — )• : 7 i— )■ i?(7) 
with the property that 7 — -^(7) G A for all 7 G F. 

Now suppose that : X' — )• X^^ is as above. We will give a construction of a 
further extension of X' based on a similar idea to that underlying the construction 
of induced group representations. 

Let C F be a fundamental region as above with associated remainder map R, 
chosen so that e G $7 where e is the identity of G. This defines uniquely an 
'integer part' map F — )• A : 7 1— )• [7J := 7 — -^(7), and it follows at once 
that i?(7 + 7') = + R{i)) = R{R{j) + R{j')). 

By Rokhlin's Skew-Product Representation (see, for example. Section 3.3 of Glas- 
ner U/TJ) the extension ^ of A-systems can be described by a decomposition of X 
as Ui<n<oo 1^*^° r'^-invariant sets and, letting (y„, f„) be {1,2,..., n} with 
uniform measure for 1 < n < 00 and (1^00 , z^oo) be [0, 1) with Lebesgue measure, 
for each n a Borel cocycle : A x yl„ — )• Aut(y„, J^n), so that 

(5')^(x,y) = (r^x,<I>„(7,x)(y)) 

for {x, y) G An x y„ for all 7 G A. We adopt the simplified notations <I>(7, x) := 
$„(7, x) and {Yx, Vx) '■= {Yn, ^n) when x G An- For brevity we will simply write 
X' = Xv. Y,. 

We base our construction of A" on a specification of new, enlarged fibres above 
each point x G A. For 2; G A let := Htjen ^T-^x' define A := A x Y,. 
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Because Q is countable this product can be given the structure of a standard Borel 
space in the obvious way. 

We define the F-action T on X by its Rokhlin representation. Writing a typical 
element of as {yoj)uj, we set 

f'^{x,iyM := [T^x, ($(Li?(a;-7)+7j,r-^^"-^)x)(y^,(^_,)))J 

(noting that this is well-defined: if {yu))u £ Dtjen ^T-"x' then y^j G ^t-^x for 
all a; G and so yR(oj-'y) £ Yrp-R(co-y)^ = Yt-c^^tix)' because the assignment 
X i-> is A-invariant). 

Using our simple identities for R we can compute that 

r'^{r'^x,iy^U 

= {T^^x, {^mu - 72) + 72j,T-^(-^^)a;)(2/«(,_,,))) J 

miRiio - 71) + 71 J , T-^('^-^i)t^2 3.) 

{^{lR{R{u; - 71) - 72) + 72J , r-«W-^i)-^^)x)(to(«(,_^,)_^,)))) J 

($(Li?(a; - 71) + 7iJ,T-^^"~^i)r^^x) 

miRiu - (71 +72)) + 72j,T-«('^-(^^+^^))x)(y«(,_(,,+^,))))) J, 

and now we note that 

rp[R{oj-{'n+'Y2))+'Y2}j^-R{cj-{7l+'y2))^ _ 2iL'*'~{7l +72)4-72] -L'*'-{7l+72)J-ii(w-(7i +72))^ 

_ j>L'*'-7lJ-(<*'-(7l+72))2. 

_ 2i(a)-7i)-ii(a)-7i)-(a)-(7i+72))^ 

_ j.--R(a)-7i)j>72^ 

and so the cocycle equation for $ gives 

^i[Riu-^i) + ji\,T-'^^'^-^'^T^^x) o - (71 + 72)) + 72J , T-^^^^-^^^+^^^^x) 

= <^>{[R{u - 71) + 7iJ + [Riu^ - (71 + 72)) + 72J ,T-^("-(^i+T2))x) 
= ^(-L'^ - 7iJ + L'^ - 7iJ -[^- (71 +72)J,T-^('^-(^i+^^))x). 

Inserting this into the above formula for T">'i (f"''^ {x, {yi^)^)) shows that it is equal 
to f'71+72 {yuj)uj), and hence that T is a T-action. 
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Finally, if we let 



TT : {x, {ycu)u) ^ X and a: {x, (y^)ui) ^ {x, ye) 
(recalling that e £ Q) then it is routine to check that 

7r(r^(x, {y^)^)) = T^x = r%(x, {y^)^) V7 € T 

and 

a(rT(x, (y^)^)) = a{T^x, ($(Li?(^ - 7) + tJ , T-«('^-^)x)(y^(^_^))) J 
= (T^ci>(Li?(-7)+7j,r-^(-^)x)(y^(^,))) = {T\^^,x){ye)) = (5')^(x,ye) 

when 7 G A, so we have the required commutative diagram 

xrA ^ ^x^A 




Definition 3.17 (Fibrewise power extension). We will refer to the particular ex- 
tension vr : X — X constructed above as the fibrewise power extension ( or FP 
extension) ofS. corresponding to the subgroup A < F and system extension ^. 

Remark Interestingly, the appeal made to the discreteness of F/ A in the above 
proof seems to be quite important. While other instances of this theorem are cer- 
tainly available, it seems to be difficult to prove a comparably general statement for 
an inclusion A < F of arbitrary locally compact second countable Abelian groups; 
it would be interesting to know whether some alternative construction could be 
found to handle that setting. It is also worth remarking that there are certainly 
pairs of locally compact non-Abelian groups for which the conclusion fails: for 
example, by the Howe-Moore Theorem (see, for instance. Section 3.3 of [12]) any 
ergodic action of a non-compact connected simple Lie group G with finite cen- 
tre is mixing, and so any non-mixing ergodic action of R cannot be extended to a 
larger M-system in which the action can be enlarged to the whole group G for any 
embedding M G as a one-parameter subgroup. < 

We can now quickly derive some consequences for satedness. 

Lemma 3.18. IfK < F are as above and C is an idempotent class that is insensitive 
to the forgetful functor to A-actions then a T -system X is C-sated if and only ifK^^ 
is C-sated. 
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Proof It is clear that if X admits a C-adjoining that is not relatively independent 
over CX, then simply applying the forgetful functor gives the same phenomenon 
among the A-subactions: thus the C-satedness of X^^ implies that of C. The re- 
verse direction follows similarly, except that a given C-adjoining of X^^ may need 
to be extended further (for example, to an FP extension) to recover an action of the 
whole of r, and this extension of X will then witness that it is not sated. □ 

Corollary 3.19. For any subgroups Ai , A2 , . . . , Ajt < Z*^ a system X is ( Vi<fc ^0 ' 
sated if and only j/xK^i+^2+-+^fc) is ( y.^^ Z^')-sated. □ 



4 Some applications to characteristic factors 
4.1 The Furstenberg self-joining 

Consider a Z'^-system X = {X, ji, T). As have many previous works in this area, 
our analysis of characteristic factors associated to different sequences of linear 
averages will make heavy use of a particular class of self-joinings of X. Given k 
directions pi, P2, • • • , Pifc € Z'', let us here write 

N k 

SN{fi,f2,...jk) ^=]^En/^°^"''' 

n=l i=l 

for the associated A;-fold linear nonconventional averages. Now for Ai, A2, . . . , 
€ we can define 

//TPi,TP2,...,TP/i(^i X X • • • X Afc) := lim / ^atIUi, U2, • • • , Uj d//, 

and now it is routine to show (recalling that X is standard Borel) that this extends 
by linearity and continuity to a fc-fold self -joining of on X^, which is invariant 
under not only the Z'^-action T^'' but also the 'diagonal transformation' T := 
j-pi X X • • • X TPfc. This is the Furstenberg self -joining of /x associated to the 
transformations T^^, T^'^,. . . , T^'<=, and will be denoted by /xf.pi j^p^ or /x^, 
or sometimes abbreviated to /jF. Clearly whenever /j G L°°{n) for i = 1,2, . . . ,k 
we have also 

/ /i /2 ■ • • (8) /fe d/XrPi,TP2,...,rPfc = Jim / SNifi,f2, ■■■,fk) d/x. 

Jxk ' ' ' N-^ooJx 

Of course, the existence of the above limit follows from the known convergence 
of linear nonconventional averages. (Once convergence of the relevant polynomial 
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nonconventional averages has been established, a similar definition can be made 
corresponding to such polynomial averages, but in the nonlinear case these have 
yet to prove similarly useful.) 

In fact, in H the convergence of the above values is proved alongside the conver- 
gence of the nonconventional averages as part of a zigzag induction (one claim for 
a given k implies the other for that k, which then implies the first for /c + 1, and 
so on). This is possible in view of a reduction of the above limits to the study of 
linear averages involving only k — 1 transformations, which will also be important 
for us here: simply because /i is T-invariant, and working now in terms of bounded 
functions rather than sets, we can re-write the above limit as 

» ^ N k 

•^^ n=li=2 

Knowing that the Furstenberg self-joining /i|,p^ ^pj ^p^. exists, a basic applica- 
tion of the Hilbert space version of the classical van der Corput estimate gives us 
a way to use it to control the asymptotic behaviour of S^ifi, f2, ■ ■ ■ , fk), in the 
sense of the following estimate taken from Lemma 4.7 of 0. (We have modified 
the statement to give expUcit bounds, but the proof is unchanged.) 

Lemma 4.1. If fi, f2, ■ ■ ■ > fk S L°°(fi) are 1-bounded and e > Q is such that 

lim ||S^(/i,/2,...,/d)||2 >e, 

then there is a 1-bounded T -invariant function g G L°°{fi^) such that 

[ ]l{f^o7^,).gd^l}\>e\ 

□ 

This now implies a useful sufficient condition for characteristicity of a tuple of 
factors. 

Corollary 4.2. A tuple of factors : X — t- Yj, i = 1,2, ... ,k, is characteristic for 
the averages Sn if for any choice of fi G (/u), i = 1, 2, . . . , fc, and T-invariant 
g G (/i?.) we have 
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Proof Suppose that /j G (ji) for each i = 1, 2, . . . , /c. We need to show that 



/2, . . . , A) ~ SNiE^ih I 6), E^(/2 I 6), • • • , E^(/fc I 6)) 

as — )• oo, but by replacing each function with its conditional expectation in turn 
it clearly suffices to show that 

SnU'i^ /2, • • • , fk) ~ 'S'Af(E/i(/l I 6)) /2, • • • , fk)- 

This, in turn, is equivalent to 

57v(/l-E^(/l|6),/2,...,/fc)^0, 

and this now follows from the assumption and Lemma 14.11 because for any T- 
invariant g G L°° (/i^) we have 

= [ E^(/iie^)-n(/io^,).5d4 

d 

E^{h\ii))-\{{hoTTi)-gdixl. 

i=2 

□ 

Remark An alternative to the Furstenberg self -joining that can sometimes be put 
to similar uses has been constructed by Host and Kra, first in the case of powers of 
a single transformation in flT] and then for several commuting transformations by 
Host in [18|. This is defined in terms of a tower of iterated relatively independent 
self -products, and so has the advantage over the Furstenberg self -joining that it 
does not require an appeal to a previously-known nonconventional convergence 
result for its definition. Although we shall focus on the Furstenberg self-joining 
here for consistency, I suspect that the present paper and its sequel could be re- 
worked to use a Host-Kra self-joining throughout, and that neither presentation 
would be substantially easier. (In early drafts of these papers, the preference for 
the Furstenberg self-joining was dictated by a particular appeal to it in the last 
stages of proving Theorem 11.41 but subsequent improvements to that proof have 
made these unnecessary.) <l 



and so 
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It is easy to see that a the tuple of factors (Ci, ^2, • • • , Cd) is characteristic for the 
averages -Sat if and only if each of the d tuples 

(^i,idx,idx, • • • ,idx), 
(idx,6,idx,---,idx), 

(idx,idx,idx, ■■■,Cd) 

is characteristic for them. A slightly more subtle property that we will find useful 
later is the following. 

Lemma 4.3. For any factor ^ : X. ^ Y the tuple idx, • • • , idx) is character- 
istic for the nonconventional averages 

N k 
n=l 1=1 

if and only if the tuple (idx, idx, • • • , idx) is characteristic for the nonconven- 
tional averages 

1 ^ 

-j;(/oor-"PO- n /ior"(P'-P^) /o,/i,...,/,_i,/,+i,...,/feGL-(/x) 

n=l 

for every j = 2,3,... ,k. 

Proof This follows from a similar re-arrangement to those we have already seen 
above. By symmetry it suffices to treat only one of the needed implications, so let 
us suppose that idx, • ■ ■ , idx) is characteristic for 

^ AT k 

n=l i=l 

and show that (idx, ^, idx, • • • , idx) is characteristic for 

1 ^ 

- j;(/oor-"P2). n /ioT"(P'-P2) foJij3,...JkeL'^{p). 

n=l i<k,i^2 

Replacing /i by /i — E^(/i | ^i), it will suffice to show that if the latter averages do 
not tend to zero in for some choice of fo, f 3, fk then also the former do 
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not tend to zero for some choice of /2, /s, . . . , /a,.. Thus, suppose that fo, fa, ■ ■ ■ , fk 
are such that the latter averages do not tend to zero, and now let 



1 ^ 

N^oo iV ^ — ' 

n=l i<k,i^2 

The condition that /2 7^ and a change of variables now give 

N 



•' ^ n=l i<k^i^2 

N 

= J™ ^E / /2-(/oor-"P2)- n /.°2-"^P'-p^^d/i 

Af->oo iV ^ — ' / V 

n=l''^ i<k,i=/=2 

N 

n=l''^ i<k,i^2 
„ AT A: 

}-//°-(ivEn/.o^""-)d''' 

•^^ n=li=l 



and so we must also have 

N k 

^En/^°^""7^0' 

n=l i=l 

as required. □ 

Example An easily-generalized argument of Ziegler [32 ] in the case d = 1 and 
Pi = Oi G Z shows that there is always a unique minimal characteristic factor 
tuple: a characteristic tuple of factors : X — )• Yj, i = 1,2, ... ,k, such that 
any other characteristic tuple of factors : X — )■ must satisfy ^ for all 
i < k. However, it is worth noting that the members of this tuple can depend on 
the whole system X, in that if we restrict to averages involving functions fj that 
are all lifted from Yj for some fixed i, then Yj may in turn admit a characteristic 
tuple of smaller factors. 

For example, when d = 2 and p/ = for i = 1, 2, consider three irrational and 
rationally independent points on the circle r,s,t G T, and let X be the system 
on {X, fi) = (T^, mj2) generated by Ti := Rs x Rt and T2 := Rr x Rg. In this 
simple setting we can use Fourier analysis to obtain that the minimal characteristic 
factors ^1, ,^2 are equivalent to the first and second coordinate projections — )• T 
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respectively. However, after passing down through the first coordinate projection, 
it is equally easy to compute that the minimal characteristic factors of the resulting 
system on (T, mj) are both trivial. 

It follows that there is in general no tuple of idempotent classes of systems Ci, C2, 
. . . , Cfc such that Yj = C^X for every X. This contrasts interestingly with the case 
d = 1, where the main technical result of Host and Kra [21] and Ziegler [32] can 
be phrased as asserting that there is such a class, and for k distinct integers pi, P2, 
. . . , Pfc € Z we have Ci = C2 = • • • = Cfe and it is the class of all 'direct integrals' 
(suitably defined) of inverse limits of /c-step nilsystems. < 

Let us finish by recording the following useful property of minimal characteristic 
factors. 

Lemma 4.4. (X(-^))m,>o. i'^^(^)m>k>o ^'^ inverse system with inverse limit 
^(oo)> (^{m))m>o cind the factors of the minimal characteristic tuples of these 
systems for some averaging scheme are ^(rn),i> ^ = 1,2,. . . ,k and ^(oo),i. ^ = 
1,2, ... ,k respectively then 

C(oo),j = V C(m),i o V'(m)- 
m>0 

Proof The direction ^ is obvious (since any particular nonconventional averages 
on system X(^) can be lifted to X(oo)), so we need only show the reverse contain- 
ment. 

To this end, suppose that /j e L°° (/i(oo) ) for z = 1, 2, . . . , A:. Then by the definition 
of the inverse limit, we know that we can approximate these functions arbitrarily 
wellinL^(/i) by functions of the form (jij o ^^^-j for gfj G with ||(7j||oo < 

1 1 /ill 00 and m sufficiently large. This approximation now clearly gives 

5'(oo),Af(/l,/2,- • -Jk)^ S(^rn),Nigi,92, ■ ■ ■ , Ok) ° ^(m) inL^(^) 

uniformly in N, and this latter behaves asymptotically as 

'S'(m),Af (E/i(„) (gi I C(m),l)> E^(„) (92 I C(m),2)> • • • i E^(„) idk I ({m),k)) ° V'(m) 
= 'S'(oo),Ar(E/.(^) iai ° i^{m) I C(m),l ° V'(m)), E^(oo) (^2 ° V'(m) I C{m),2 ° V'(m))> 

• • • ' ^Ho^) i9k ° V'(m) I C{m),k ° ^(m))) 

in L2(/i) as iV oo, by the defining property of ?(m),2> ■ ■ ■ > C(m),k- 

This, in turn, is approximately equal to 

'S'(oo),Ar(E/i(^) (/l I S.{m),l°i'{m)), ^fi^ao) I ^(m) ,2°"^ (m)) ■, ■■■ , E^^^j (/fc | 6.(m),k°1p{: 
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and so letting m oo and observing that each (E^^^j (/,, | ^(^) ^ o V'(m)))m>i for 
i = 1,2, . . . ,dis a. uniformly bounded martingale, we obtain 

5'(oo),Ar(/li /2, ■ ■ ■ ,fk) 

~ 5'(oo),Af(E/x(oo)(/l I ^l)' EAt(oo)(/2 I • • • ' E^(oo)(/fe I ^fc)) 

as A'' — 00 with 

m>0 

and hence ^(oo),i — 4°' required. □ 

4.2 Linearly independent directions in discrete time 

In this section we will address the easier of the questions posed in the introduction: 
whether we can construct pleasant extensions while retaining the existence of roots 
for our transformations. In fact it will follow quite easily from the machinery 
developed above that FIS extensions achieve this goal. 

Proposition 4.5 (FIS extensions are pleasant). If (X, /i, T) is an FIS system and 
pi, p2, . . . , Pk € Z"^ are linearly independent then the tuple of factors 

je{i,2,...,fc}\{i} 

is characteristic for the associated linear nonconventional averages 

N k 

n=l i=l 

The case in which in addition Zpi + Zp2 + • • • + Zp^ = Z'^ (so necessarily k = d) 
is implicitly contained in |4|; the point here is to handle the case when the pj 
generate a proper sublattice. 

Before turning to Proposition 14.51 we illustrate our basic method by proving the 
following useful lemma (which is, in turn, implicitly contained in 131). For our 
purposes a tuple of isomorphic embeddings pj : TH'i U^, j = 1, 2, . . . , A: is 
totally linearly independent if 

Piini) + p2{n2) + . . . + Pkink) = £ Z'^ ^ nj = £ Z''^ < k. 
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Lemma 4.6. If (X, jjl, T) is an FIS system then whenever pj : ^ Z*^ are 
totally linearly independent isomorphic embeddings for j = 1,2, ... ,k we have 



j=2 j=2 

where T^i is the 1/i -action n ^ TP^i'^l 

Proof Let A := ) + P2{Z''^) + ■■■+ Pki^''") < Z"^. We first suppose A = 

Z^, and then use this to prove the general case. Note that since our pj are injective 
and linearly independent, in this special case they together define an isomorphism 

It is clear that for any system {X, T) we have 

j=2 j=2 

SO we need only prove the reverse containment. Let 

(X,fi) := {X x^TPi X,ii®^TPi IJ) 

and vTi and tt2 be respectively the first and second coordinate projections X ^ X, 
and define a Z'^-action T on {X, /i) by setting 



k 



fPii-) =/ TP^^'^ xidx ifi = l 
• \ (Tx2)Pi(-) ifi = 2 



2,3, . . . , k. 

and extending additively. This is easily to seen to be a well-defined probability- 
preserving Z'^-system and an extension of {X, /i, T) through vri. Now note that the 
whole second coordinate in X is T^^ -invariant, and hence that 

k 



j=2 j=2 



\/cr''t{\jcr)on2 



3=2 ' ' 3=2 

where the last equivalence holds because C,^^^ o vri ~ C,^^^ o by construction. 
On the other hand since {X, T) is FIS the factors Vj=2 Co^'' ^''^ must be 
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relatively independent over 



i=2 j=2 



SO in fact we have 



and hence 



as required. 



i=2 j=2 
i=2 j=2 



Finally, for a general A Corollary 13. 191 tells us that the subaction system X.^^ is 
still FIS, and so since all joins of the idempotent classes Zq' are insensitive to 
the forgetful functor to A-subactions the special case treated above completes the 
proof. □ 



Proof of Proposition 1431 Let A := Zpi + Zp2 H h I^Pk- 

Once again we first treat the case A = Z'^; this is already covered in lU in slightly 
different terms, and the underlying idea here is as in that paper. Write Tj := TP* 
and fix some j < k. Consider the extension vr : X — X built from the Furstenberg 
self -joining by 

. letting {X,fi) := 

• defining the lifted transformations Tj by 

Ti X Ta X • • • X Tfc for z = j 

Tt' foriE{l,2,3,...,A:}\{i}, 

• writing vTj : — )• X, i = 1, 2, . . . , /c, for the coordinate projections, 

• and taking tt := ttj. 

Now let fi G L°°(/i) for z = l,2,...,k and let g E be fj-invariant. 

Observe from the above choice of the lifted transformations that fi o vTj is a fifj'^- 
invariant function on X for each i ^ j, and so the function ni<fe, i^j 
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X is (Co^ V V 

i<k, iT^j^o"^ ) -measurable. Since X is FIS, under fi^ this function 
is relatively independent from fj o ttj over (Cq^ V \/ ^^^ j^^j 

Q ) ° TTj-: that IS, 

writing := C? V Vi<fc,i^j ^ > we have 




Using this argument to replace fj with E^(/j | ^j) for each j in turn. Lemma |4!2] 
tells us that {^1,^2, ■ ■ ■ ,^k) characteristic, as required. 

Now if A is a general sublattice, we observe that for a given tuple of factors of our 
FIS system X, their characteristicity depends only on the subaction system Xl^"^, 
which is also FIS by Corollary 13.191 and so as for the preceding lemma the above 
special case completes the proof. □ 

4.3 An example in continuous time 

In addition to the above description of pleasant extensions for certain linear av- 
erages (by itself only a very modest generalization of technical results from [41), 
we will now offer an application of sated extensions to a different convergence 
problem for nonconventional averages. This problem is 'quadratic' and 'two- 
dimensional', which features introduce new difficulties, but it is also in 'continuous 
time', and we will find that this allows us to recover a fairly short proof. 

Thus, we now switch to the setting of a jointly measurable action M? r> {X,ij,), 
which we denote by — )• Auto{X, fj,) : v 1— )• r^. We also let ei, 62 be the 
standard basis of M^. 

Theorem 4.7. The averages 



^ Jo 

converge in LP'{^j) as T ^ 00 for any /i, /2 € L°°(fi). 

As in |4 | this will follow once we ascend to a suitable pleasant extension. 

Proposition 4.8 (Pleasant extensions for continuous-time quadratic averages). If 

the M?-system {X,p,,T) is sated for the idempotent class Z^^^ V Zq''^ then the 
factors 

?i - ?2 •- Co V Co 
are characteristic for the above averages. 
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Proof of Theorem 14.7 1 from Proposition 14.81 Given Proposition l4.8l it suffices to 
consider the averages Srifi, f2) with each fi measurable with respect to Co ^^"^ V 
Q ^^"^ . By a simple approximation in (/i) and multilinearity, the convergence 
of these follows in turn if we know it when fi = gi- hi for some gi , g2 that are 
r'^'*^^ -invariant and hi, /12 that are r^^'^^ -invariant. 

Substituting this form into the definition of St, and using first the invariance of gi 
and then that of hi , we are left with the averages 

StUu f2) = 51 4 /^(^i ° ^''^{{92 ■ h2) o r*'-i+*-^) dt 

= gi-^l\hi-g2-h2)or''-^+'-'^ dt, 

and these latter are now conventional polynomial ergodic averages, which converge 
in L'^ifJ-) simply by spectral theory and the corresponding result for the scalar av- 
erages y exp(27r(ai^ + bt)i) dt, whose convergence follows from the classical 
scalar- valued van der Corput estimate (or, indeed, in the only nontrivial case a 7^ 0, 
from a change of variables and the classical evaluation of Fresnel integrals). □ 



Remark Note that Proposition 14.81 is formulated in terms of satedness with re- 
spect to a single idempotent class, rather than by appeal to a continuous analog 
of the blanket notion of full isotropy satedness (Definition 13.131 ). This is because 
the continuous group M? has uncountably many subgroups, and so we should need 
in turn an analog of Theorem 13.1 II that allows for uncountably many idempotent 
classes. This is impossible in general without leaving the class of standard Borel 
spaces (although presumably the class of 'perfect' measure spaces, in the sense of 
Section 451 of lfT3l . is still large enough), and so would entail a barrage of new 
technical measure-theoretic details that we prefer to avoid. □ 



Proof of Proposition 14.81 This proof that starts with an important initial twist. 
We first note that we may change variables in the integral 



T 

to u := t"^, and so obtain 
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where ^ 

S'u{fij2) := I {hoT^-^){f2 0T^-^+^-^)du. 
Jo 

Thus, this change of variables has revealed that the averages 5t(/i, /2) are actually 
'smoother' than the averages ^^(/i, /2), which involve only linear and sub-lineai" 
exponents. In spite of of the non-integer power ^/u that has now appeared, we 
will now see that these are quite simple for our purposes. (This crucial trick was 
pointed out to me by Vitaly Bergelson.) 

To complete the proof we show that 

As usual, this begins with the van der Corput estimate (in its version for continuous 
families of vectors, which is exactly analogous to the discrete setting: see, for 
example. Section 1.9 of Kuipers and Niederreiter ll23l ). which after a little re- 
arrangement gives that 

5[7(/l,/2) AO 

•((/2 O r''ei+(v^-0I)e2 . o ^«ei+v/^Ie2) dfldudh 

•((/2 o t''''i+{^«+^-^)'=2 . o r^^2) dfldudh A 
as [/ — )■ 00 and then i7 — )• 00. 

The important feature here is that for each fixed h we have 
Vu + h — \fu — )• as li — )• cx), 
and hence by the strong continuity of r it follows that 

11/2 o ^'^ei + (v^-V^)e2 _ o ^/lei ||^ _^ q 
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as u ^ oo. From this it follows that for any fixed h we have 

Tr r f (/i ° ^'"'^ • ° r'^«i+(^-V^)-2 . /2) o d/xdn 

Jo 

~ ^ / / (/i o r'^'^i • /i)((/2 o r'^'^i • /2) o T^-^) d^dtx 

^ / (/i o r'^-i • /i)E^(/2 o r'^'^i • /2 I Co"'"^ ) dM as [/ ^ oo 
Jx 

= / (/l O r'^^l • /l) ® (/2 O r'^^l • /2) d(/X ® tMe2 /x) 

(this crucial simplification resulting from our change-of-variables was pointed out 
to me by Vitaly Bergelson). Now letting h ^ oo this simply tends to 



if I ® /2) • 5 d(/X (8) tRe2 n) 

for the (r®^)'^'*'^^ -invariant function 

Hence, letting {X, fi) = {X^,iJ, ^ tme2 l^), letting tt : X ^ X and lifting r to the 
action f defined by 

~sei+te2 ._ ^sei+te2 ^ ^sei 

(noting that /x (8) t»e2 is also invariant under the flow t idx x t^^'^), we see 

^0 

that we have found an extension of {X, /x, r) in which 

EaI/iottICo VCo Jt^O, 

and hence by satedness the analogous non-vanishing must have held inside the 
original system {X, iJ,,T),as required. □ 

Remark Although rather simple, it is worth noting that the use of satedness 
still played a crucial role in the above proof. Without the initial assumption of 
satedness, the above appeal to the van der Corput estimate combined with con- 
siderations of the structure of the Furstenberg self -joining tell us that for the av- 
erages Sjj{fi,f2) the pair of factors ^i, ,^2 is characteristic, where coordi- 
natizes the maximal r^'^'^' -isometric extension of the isotropy factor Co ^^ ^'^ 
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This much can be argued following the same lines as Conze and Lesigne's ini- 
tial analysis in |[8] |9l [lOl of double linear averages in discrete time for some sys- 
tem T : I? r\ {X,fi). Thus, allowing ourselves to assume that each fi is ^i- 
measurable, it now follows that each fi may be approximated by a function resid- 
ing in a finite-rank r^'*'^* -invariant module over the factor in Conze and 
Lesigne's setting (with T and Z in place of r and M) this leads directly to a proof of 
convergence, because when written in terms of unitary cocycles describing these 
finite-rank modules the double linear- averages become simply averages for some 

new 'combined' finite rank module over the single system T\ j,fz(e^-e2) , to which 

Co 

the usual mean ergodic theorem can be applied. However, in our setting matters 
are not so simple, since even after approximating and then using a representation 
in terms of unitary cocycles in this way, the expression that results still involves 
two different polynomials in the exponents, and so it is not clear how to realize 
it as some kind of more classic ergodic average. Once we assumed satedness, 
this problem vanished because the structure of a finite-rank r^*''' -invariant module 
over ""^^ is replaced by that of the factor Q^'^''' v ^j^'*'''^ ''^^ ^ for which more 

explicit simplifications to our averages are possible, as exhibited above. < 

It would be interesting to know how far into the world of polynomial nonconven- 
tional averages in continuous time one could penetrate using the method of sated 
extensions together with the important change-of-variables trick shown above. I 
strongly suspect that in this setting it would at least be possible to prove conver- 
gence for a much wider range of averages involving quadratics in the exponents. 
However, this seems to require the organization of a new induction scheme that 
allows for working with fractional-power polynomials (as a result of the changing 
of variables), and this would take us too far from the setting of discrete averages 
that the present sequence of papers aims to examine, so we will not pursue this 
further here. 

5 Next steps 

This paper has begun to showcase the far-reaching consequences of satedness in 
the study of nonconventional ergodic averages, but its larger purpose is to pre- 
pare the ground for its sequel [2J. There we will turn to nonconventional averages 
which require rather more elaborate new arguments. Indeed, only after several 
more technical steps will we be able to address even one new case of convergence 
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for polynomial averages in discrete time: that of 



N 



T7 E(^i ° o TfT^) for (ri,T2) : r> /i,/2 € L-(/i) 



(as in Theorem 1 1 .41 above). 

The analysis of these will rely heavily on some auxiliary results concerning the 
triple linear averages 



for some action T : I? r\ {X,fi) and three directions pi, p2, Pa S enjoy- 
ing some linear dependence. Such linear averages arise naturally from the above 
quadratic averages upon a single application of the van der Corput estimate, and so 
we will construct a useful notion of pleasant extension for the quadratic averages 
by first developing such a notion for these triple linear averages and then showing 
how the resulting characteristic factors can be simplified further. 

The point is that although convergence is known for triple linear averages such as 
the above, the use of extensions to prove this in [4] forgets the linear dependence of 
the Pi, effectively replacing the TP» with three independent commuting transfor- 
mations on the extended system. We cannot afford this freedom in the study of the 
quadratic averages, because after passing to such a Z'^-system it is not clear how 
the quadratic averages of interest can even be sensibly interpreted. Our main task, 
therefore, will be to see how simple a triple of characteristic factors can be obtained 
for the above linear nonconventional averages while preserving the algebraic rela- 
tions of the original Z^-action. It will turn out that we can do quite well, except that 
in addition to the factors that contribute to each in Theorem 11.11 we must now 
involve some systems on which our Z^-action is by commuting rotations on a two- 
step nilmanifold, thus re-establishing contact with earlier works such as 112X1 l32l 
and their forerunners in which the relevance of these was made clear in the setting 
of Z-actions. 
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